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BACKGROUND OF THE INVENTION 

Field of the Invention 

The present invention relates in general to the field of information handling 
systems and, more particularly, to an improved method and apparatus for storing and 
disseminating software using a cache memory on a factory server used in the 
manufacturing of information handling systems. 

Description of the Related Art 

As the value and use of information continues to increase, individuals and 
businesses seek additional ways to process and store information. One option 
available to users is information handling systems. An information handling system 
generally processes, compiles, stores, and/or communicates information or data for 
business, personal, or other purposes, thereby allowing users to take advantage of the 
value of the information. Because technology and information handling needs and 
requirements vary between different users or applications, information handling 
systems may also vary regarding what information is handled, how the information is 
handled, how much information is processed, stored, or communicated, and how 
quickly and efficiently the information may be processed, stored, or communicated. 
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The variations in information handling systems allow for information handling 
systems to be general or configured for a specific user or specific use such as financial 
transaction processing, airline reservations, enterprise data storage, or global 
communications. In addition, information handling systems may include a variety of 
5 hardware and software components that may be configured to process, store, and 
communicate information and may include one or more computer systems, data 
storage systems, and networking systems. 

In recent years, there has been an increase in the number of information 
handling systems that are manufactured based on a "build to order" process that 

10 allows a customer to specify specific hardware and software options. In "build to 
order" manufacturing systems, the operating system and other software files are 
stored as groups of compressed ("ZIP") files that are organized according to software 
part numbers (sometimes referred to herein as "SRVs"). Each of the software part 
numbers is assigned a shell script (top script), and each of these scripts contains the 

1 5 names of corresponding ZIP files to download. A typical operating system file is 
approximately 500-650 megabytes, while a base image size is approximately 1-2 
gigabytes. Different operating systems, multiple language versions and their related 
service packs can add up to several gigabytes of memory. These large file sizes cause 
problems with regard to the size of the cache required in a factory server used for 

20 manufacturing. In addition, the larger files require more download time, which in 
turn, results in longer "burn" time for each system in the manufacturing process. 

In view of the foregoing, there is a need for a method and apparatus to 
minimize the size of files that must be stored in the cache of a server used in a "build 
to order" process for manufacturing information handling systems. Such a system is 

25 provided by the present invention, as described hereinbelow. 
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SUMMARY OF THE INVENTION 

The present invention overcomes the shortcomings of the prior art by 
providing a method and apparatus for minimizing the size of the cache that is required 
to store software packages on a factory server for installation on an information 
5 handling system manufactured using a built-to-order system. In an embodiment of 
the invention, a plurality of software applications are received and are disassembled to 
separate the individual constituent program files. These program files are 
decompressed and are stored in temporary file directories. Program files that are 
common to each of the software applications are identified and indexed. After the file 

10 analysis, the files are regrouped and a stored as a new set of compressed (ZIP) files. 
Once the new set of ZIP files has been created, the top level scripts for the software 
application images are updated for use with the new ZIP files to allow the desired 
combinations of software applications to be regenerated. The new ZIP files are then 
loaded into the factory servers for use in manufacturing information handling systems. 

1 5 The method and apparatus of the present invention overcomes the 

shortcomings of the prior art by minimizing the amount of cache memory needed for 
the factory server used in the build to order system. In addition, the present invention 
significantly reduces the amount of "burn" time needed in the manufacturing of 
information handling systems, thereby resulting in increased throughput for the 

20 factory. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

The present invention may be better understood, and its numerous objects, 
features and advantages made apparent to those skilled in the art by referencing the 
accompanying drawings. The use of the same reference number throughout the 
5 several figures designates a like or similar element. 

Figure 1 is a general illustration of an automated build-to-order system for 
installing software on an information handling system. 

Figure 2 is a system block diagram of an information handling system. 

Figure 3 is an illustration of a software application comprising a plurality of 
10 program files. 

Figure 4 is an illustration of a plurality of decompressed program files after 
disassembly of the software applications illustrated in Figure 3. 

Figure 5A is an illustration of a software image file comprising a plurality of 
reassembled program files. 
1 5 Figure 5B is an illustration of a software image file comprising the original 

software applications containing a plurality of redundant program files. 

Figure 6 is an illustration of an embodiment of an automated system for 
analyzing a plurality of software systems and for generating a plurality of base images 
that requires minimal storage space in the cache of a factory server. 
20 Figure 7 is a flowchart illustration of the processing steps for implementing 

the method of the present invention for reducing the amount of cache storage needed 
in a factory server. 
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DETAILED DESCRIPTION 

Figure 1 is a schematic diagram of a software installation system 100 at an 
information handling system manufacturing site. In operation, an order 1 10 is placed 
to purchase a target information handling system 120. The target information 
5 handling system 120 to be manufactured contains a plurality of hardware and software 
components. For instance, target information handling system 120 might include a 
certain brand of hard drive, a particular type of monitor, a certain brand of processor 
and software. The software may include a particular version of an operating system 
along with all appropriate driver software and other application software along with 

10 appropriate software bug fixes. Before target information handling system 120 is 
shipped to the customer, the plurality of components are installed and tested. Such 
software installation and testing advantageously ensures a reliable, working 
information handling system which is ready to operate when received by a customer. 
Because different families of information handling systems and different 

15 individual computer components require different software installation, it is necessary 
to determine which software to install on a target information handling system 120. A 
descriptor file 130 is provided by converting an order 1 10, which corresponds to a 
desired information handling system having desired components, into a computer 
readable format via conversion module 132. 

20 Component descriptors are computer readable descriptions of the components 

of target information handling system 120 which components are defined by the order 
110. In an embodiment of the present invention, the component descriptors are 
included in a descriptor file called a system descriptor record which is a computer 
readable file containing a listing of the components, both hardware and software, to 
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be installed onto target information handling system 120. Having read the plurality of 
component descriptors, database server 140 provides a plurality of software 
components corresponding to the component descriptors to file server 142 over 
network connection 144. Network connections 144 may be any network connection 
5 well-known in the art, such as a local area network, an intranet, or the internet. The 
information contained in database server 140 is often updated such that the database 
contains a new factory build environment. The software is then installed on the target 
information handling system 120. The software installation is controlled by a 
software installation management server, discussed in greater detail below, that is 

10 operable to control the installation of the operating system and other software 
packages specified by a customer. 

Figure 2 is a generalized illustration of an information handling system, such 
as the target information handling system 120 illustrated in Figure 1. The information 
handling system includes a processor 202, input/output (I/O) devices 204, such as a 

15 display, a keyboard, a mouse, and associated controllers, a hard disk drive 206, and 
other storage devices 208, such as a floppy disk and drive and other memory devices, 
and various other subsystems 210, all interconnected via one or more buses 212. The 
software that is installed according to the versioning methodology is installed onto 
hard disk drive 206. Alternately, the software may be installed onto any appropriate 

20 non-volatile memory. The non-volatile memory may also store the information 
relating to which factory build environment was used to install the software. 
Accessing this information enables a user to have additional systems corresponding to 
a particular factory build environment to be built. 

For purposes of this disclosure, an information handling system may include 

25 any instrumentality or aggregate of instrumentalities operable to compute, classify, 
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process, transmit, receive, retrieve, originate, switch, store, display, manifest, detect, 
record, reproduce, handle, or utilize any form of information, intelligence, or data for 
business, scientific, control, or other purposes. For example, an information handling 
system may be a personal computer, a network storage device, or any other suitable 
5 device and may vary in size, shape, performance, functionality, and price. The 
information handling system may include random access memory (RAM), one or 
more processing resources such as a central processing unit (CPU) or hardware or 
software control logic, ROM, and/or other types of nonvolatile memory. Additional 
components of the information handling system may include one or more disk drives, 

10 one or more network ports for communicating with external devices as well as various 
input and output (I/O) devices, such as a keyboard, a mouse, and a video display. The 
information handling system may also include one or more buses operable to transmit 
communications between the various hardware components. 

Figure 3 is a conceptual illustration of a plurality of software application 

1 5 images 302, 304 and 306, each of which comprises a plurality of program files. For 
purposes of discussion herein, a software operating system or other high level 
software file, such as a word processor, will be referred to as a "software application," 
while the plurality of files that are contained within the software application will be 
referred to as the "program files." Each of the software application images illustrated 

20 in Figure 3 comprise a plurality of program files illustrated by the reference letters A, 
B, G. Each of the software applications 302, 304, and 306 are comprised of 
various combinations of some, but not all, of the program files A, B, . . ., G. As will 
be understood by those of skill in the art, a library of program files A, B, . . ., G 
contains all of the files that are necessary to reconstruct the software applications 302, 

25 304, and 306. 
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Typically, the software applications 302, 304, and 306 are supplied by vendors 
in a compressed format, such as a "ZIP" file, wherein the various program files are 
compressed. In the method and apparatus of the present invention, the software 
applications, 302, 304, and 306 are initially disassembled and each of the program 
5 files are decompressed or "un-ZIPped." 

Figure 4 is a conceptual illustration of the disassembled and decompressed 
program files A, B, . . ., G wherein each of the program files represents an independent 
program file that can be used to reassemble a software application. These individual 
program files are stored in temporary directories and are then indexed and analyzed in 
10 a software dissemination server, discussed hereinbelow in Figure 6, to generate an 

index of program files representing all of the program files that are needed to generate 
a family of software applications, such as the software applications 302, 306 and 308 
illustrated in Figure 3. 

Figure 5A illustrates a composite program file image library 308 that contains 
15 all of the program files A, B, .. ., G that are necessary to create the original software 
applications, 302, 304, and 308. The composite program file image library contains 
one copy of each of the unique program files; however, all redundant copies of the 
program files have been removed. The program file image library 308 can be stored 
in a program storage cache in a factory server, as discussed hereinbelow, to allow 
20 improved efficiency in the manufacture of information handling systems. 

Figure 5B is a comparative illustration of the relative size of the combined 
software applications 302, 304, and 306 that would be stored in a program storage 
cache in a factory server. As can be seen, the amount of storage (illustrated by 5A) 
required to store each of the software applications 302, 304, and 306 is substantially 
25 larger than the amount of storage (illustrated by 5B) that is required to store the 
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composite program file image library 308 that contains the individual program files 
A, B, G. 

Figure 6 is an illustration of the components of the automated system for 
converting, optimizing and disseminating software in accordance with the present 
5 invention. Software applications 302, 304 and 306 are delivered to the system via a 
firewall 606 and are received by a software dissemination server 608. When a new 
software application is received, the software dissemination server 608 unpacks the 
application file into individual program files. The software dissemination server 608 
scans the software applications for viruses and then transfers the packages to the 

10 repack and script regeneration server 612. The repack and script regeneration server 
612 is operable to disassemble the software applications 302, 304, and 306 into 
individual program files. In this process, the server 612 disassembles and 
decompresses the software applications 302, 304, and 306 into the individual program 
files, such as the program files A, B, and G discussed hereinabove in connection 

15 with Figures 3-6 and stores these individual program files in temporary directories. 
The repack and script regeneration server 612 compares the individual 
program files with the existing files in the composite program library indexed and 
stored in the archive server 614. If an individual program file already exists in the 
composite program file library, that file is discarded to reduce the redundancy. The 

20 repack and script regeneration server 612 establishes a reference in the shell script for 
that software application to the file in the composite program file library. That file is 
already available on the download server 616 as well. 

The repack and script regeneration server 612 also constructs an index of the 
program files contained in each of the software applications 302, 304, 306. This 

25 index and the related program files are used to construct a composite program file 
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image library that is transferred to the download server 616, which includes a 
software image cache 617. The composite program file image library contains one 
copy of each of the unique program files but, as discussed herein, all of the redundant 
copies of the program files have been removed. The program files, e.g., program files 
5 A, B, . . ., G, assembled into the composite program file library 308, as discussed 

hereinabove, are stored in the software image cache 617 to decrease the time required 
to access these files, thereby improving the performance of the "build to order" 
system. 

The repack and script regeneration server 612 generates top level factory 

10 scripts for each of the program files to produce "factory installable bits" 

corresponding to the various software applications. These "factory installable bits" 
are then transferred to the download server 616. Copies of the program file images 
are also transferred to an archive server 614. The repack and script regeneration 
server 612 then generates a signal authorizing the script and installation validation 

15 server 61 8 to generate appropriate commands to control downloading of software 
application to the target information handling system 120. The results of the 
installation are monitored by the server 618 and results are communicated to the 
software dissemination server 608 while the actual software images are downloaded 
by the download server 616 onto the hard drive or other storage media of the target 

20 information handling system 1 20. 

After the software images have been downloaded to the target information 
handling system 1 20, tests are performed by a test validation sever 620 which 
performs a series of tests to confirm that the software images have been properly 
installed and that the operational integrity of the software package is satisfactory. The 

25 results of the test performed by the test validation server 620 are communicated to the 
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software dissemination server 608 which is operable to generate a status report for 
viewing by a program manager 622. 

Figure 7 is a flowchart illustration of the processing steps implemented by the 
system of the present invention. In step 702, the source software applications 
5 containing the compressed program files are entered into the system. In step 704, the 
source software applications are disassembled to obtain the individual program files. 
In step 706, the program files are decompressed (un-ZIPped) and stored in temporary 
directories. In step 708, the decompressed program files are analyzed and in step 710 
an index of the unique program files is created and redundant copies of the program 

10 files are eliminated. In step 712 the repack and regeneration script server 612 
generates compressed program files and top level scripts which, in step 714, are 
stored on the download server 616. In step 716, the compressed archive files are 
stored on the archive server 614. In step 718, the factory script is validated and 
installation of the software files is authorized. In step 720, software images are 

1 5 downloaded to the target information handling system. 

Testing of the software can be accomplished using the system and methods 
described in co-pending application Serial Number 10/267,513, filed on October 9, 
2002, entitled "Method and System for Test Management," which by this reference is 
incorporated for all purposes. 

20 Other Embodiments 

Other embodiments are within the following claims. 

For example, the above-discussed embodiments include software modules that 
perform certain tasks. The software modules discussed may include script, batch, or 
other executable files. The software modules may be stored on a machine-readable or 
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computer-readable storage medium such as a disk drive. Storage devices used for 
storing software modules may be magnetic floppy disks, hard disks, or optical discs 
such as CD-ROMs or CD-Rs, for example. A storage device used for storing 
firmware or hardware modules may also include a semiconductor-based memory, 
which may be permanently, removably or remotely coupled to a microprocessor 
memory system. Thus, the modules may be stored within a computer system memory 
to configure the computer system to perform the functions of the module. Other new 
and various types of computer-readable storage media may be used to store the 
modules discussed herein. Consequently, the invention is intended to be limited only 
by the spirit and scope of the appended claims, giving full cognizance to equivalents 
in all respects. 

Although the present invention has been described in detail, it should be 
understood that various changes, substitutions and alterations can be made hereto 
without departing from the spirit and scope of the invention as defined by the 
appended claims. 
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